ROOTS OF QUATERNION STANDARD POLYNOMIALS 
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Abstract. Here we present a reduction of any quaternion standard poly- 
nomial equation into an equation with two central variables and quaternion 
coefficients. If only pure imaginary roots are in demand, then the equation is 
with one central variable. As a result of this reduction we obtain formulas for 
the solutions of quadratic equations. Another result is a routine for analyti- 
cally solving cubic quaternion equations assuming they have at least one pure 
imaginary root. 



I. Introduction 

Let D be a division ring. By a standard polynomial we mean a polynomial of 
the form a n z n + ■ • • + a\z + a^, where a^, . . . ,a n G D, i.e. a polynomial where 
each monomial is a product of a coefficient (from the left) and the variable to 
some power (from the right). Polynomials of a more complicated form, such as 
f(z) = izjzi + z 2 j — zi + jz(ij — 5) + 6i — 9, are not dealt with here. 

The ring of standard polynomials is denoted by Dl[z]. L stands for "left", due 
to the standard form, in which all the coefficients are on the left-hand side of the 
monomial. These polynomials are also known as "left polynomials". (See Gordon 
and Motzkin paper [2] or Lawrence and Simsons's survey [6]) 

The element z in this ring is assumed to commute with all the elements in D. 
When substituting an element d — > z we follow the standard form written above. 

The substitution map Sd '■ Dl[z] — > D is not a homomorphism for non-central 
d G D, for example: If f(z) = az and ad ^ da then Sd(f 2 ) = Sd(a 2 z 2 ) = a 2 d 2 
while S d (f) 2 = S d (az) 2 = (ad) 2 ^ a 2 d 2 . 

In [2] Gordon and Motzkin proved (among other things) that standard poly- 
nomials of degree n have either an infinite number of roots or no more than n. 
Previously, Niven proved the special case of n = 2 and D = M in [7] . 

If a G D is a root of f(z) G D L [z] then f(z) = g(z)(z—a) for some g(z) G -Dl[Y]. 
However, if a, b G D are two different roots of f(z) then there does not necessarily 
exist any g(z) G Dl\ z \ such that f(z) = g(z)(z — a)(z — b). For example, % and j 
are two distinct roots of f(z) = z 2 + 1, but f(z) is not a multiple of (z — i)(z — j). 

Still, we have Wedderburn's decomposition method (see [8]): If f(z) = g(z)h(z) 
and a is a root of f(z) but not of h(z) then h(a)ah(a)~ 1 is a root of g(z). Con- 
sequently, if n = deg(/) distinct roots of f(z) are known then we can factorize 
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f(z) completely to linear factors. The opposite is not true, i.e. there is no simple 
algorithm for finding the roots of a polynomial knowing its factorization. 

Up to now, the most general form of standard polynomial equation to be solved 
analytically is the standard quadratic equation z 2 + az + b = over H, which 
was solved in 2005 simultaneously by Huang and So (in [3]) and Au-Yeung (in 
PQ), both providing explicit formulas. Here we show how a very similar routine 
is a result of a more general theorem which concerns all quaternion standard 
polynomials. 

Furthermore, we describe a routine for solving cubic equations analytically 
assuming they have at least one pure imaginary root. 

In [1] Janovska and Opfer reduce the problem of solving any standard quater- 
nion equation of degree n to a real equation of degree In. However, for the case 
of n = 2 it is not the optimal, since there are reductions into equations of degree 
3 instead of 4 ([3j, [1]). For the case of n = 3, the corresponding real equation is 
of degree 6 and in general it is analytically unsolvable, even though for numerical 
purposes it is good enough. 

2. Roots of a quaternion standard polynomial 

Let F be a field of characteristic other than 2 and Q = (a, /3)2,f — F[i,j : i 2 = 
a,j 2 = (3,ij = —ji] be a division quaternion algebra, where a, (3 G F. Every 
element c G Q is of the form c = C\ + CjZ + Cjj + Cijij where Ci, q, Cj, G F. The 
real part of c is 9ft (c) = c\ and the imaginary part is 3(c) = Qi + Cjj + Cijij. An 
element is called pure imaginary if c = 3(c) , or equivalently if 9ft (c) = 0. 

The standard (symplectic) involution carries c to c = c\ — c%i — Cjj — c^ij and 
N(c) = cc is in F . Furthermore, a noncentral element c G Q is quadratic (i.e. 
c 2 G F) if and only if dt(c) = c\ = 0, i.e. c is pure imaginary. 

The set of all quadratic elements in Q forms an F-subspace of Q, V = Fi + 
Fj + Fij = {z G Q : 9ft(z) = 0}, which is also the space of all pure imaginary 
elements. For any z G V, N(z) = — z 2 . 

Let there be a monic polynomial f(z) = z k + a fc _ 1 2; fc_1 + ■ ■ • + aiz + a G Ql[z] 
where a^-i, . . . , a G Q and a 7^ 0. 

Setting z = r + a; and iV = —x 2 , for any / G -Dl[z] there exist unique polyno- 
mials g,h G <3l[t, iV] such that /(z) = g(r, N)x + h(r, N). 

The substitution satisfies /(z ) = p(K(^o), N(S(z )))S(z )+/i(K(Z ), N(3(z ))) 
for any z G V. 

Theorem 2.1. Given an element zq = r$ + Xo G Q such that — Xq = Nq & F, zq 
is a root of f(z) if and only if one of the following 

(1) (r , iV ) is a solution to both h(r, N) = and g(r, N) = 0. 

(2) (r , iVo) is a solution to the equation —g{r, N)g(r, N)g(r, N)N = h(r, N)g(r, N)h(r, N) 
and x = -g(ro,N y 1 h(r , N ). 
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Proof. If (r , N ) is a solution to both h(r, N) = and g(r, N) = then f(z ) = 
g(r , N )xo + h(r , N ) = Oxo + = 0, i.e. z is a root of f(z). 

If (r ,N ) is not a solution to both h(r,N) = and g(r,N) = while z$ is 
a root of /(z), then (r ,N ) a solution to neither h(r,N) = nor g(r,N) = 

0. In this case g(r ,N )xo = —h(N ), and therefore g(r , N Q )g(r , N Q )xo = 
N(g(r , N ))x = -g(r , N Q )h(r , N ). 

Consequently -N(<?(r , N )) 2 N = g(r , N )h(r , N )g(r , N )h(r , N Q ), 

1. e. -g(r 0} N )N(g(r ,N ))N = g{r 0) N )h(r , N )g(r , N )h(r 0l N Q ). This is 
surely not the trivial equation, because the difference between the lowest degree 
among the nonzero monomials of the right-hand side of the equation and the low- 
est degree among the nonzero monomials of the left-hand side of the equation is at 
least 1. Consequently, (r , iVo) is a root of the equation g(r, N)g(r, N)g(r, N)N = 
h{r,N)g(r,N)h(r, N). □ 

One result of this theorem in this form is the formulation of the solution to a 
quadratic equation. 

3. Solving quadratic equations 

Both [1] and [3] provide methods for solving quadratic quaternion equations. 
Here we shall show how a very similar method can be obtained not by applying 
Theorem EU 

Let f(z) = z 2 + az + b. By replacing z with z — we may assume that 
9ft(a) = 0. The case of a = is simple. If b is not central then it is enough to 
solve the equation in the field F(b), and if b is central then the equation can be 
solved in any field -F(A) for any A G Q. 

Therefore we assume a ^ 0. Now, since 9ft(a) = 0, b = m + na + d where 
m,n G F and ad = —da. The case of d = is again simple, because it is enough 
to solve the equation in F[a\. Consequently we shall assume that d ^ 0. 

Theorem 3.1. If a,d ^ then for each root zq of f(z) = z 2 + az + m + na + d, 

(1) //n / then ro = 9ft(-2o) is a solution to the equation 16r 6 + (—8a 2 + 
lQm)r 4 + (-a 2 (4m-a 2 ) + 4a 2 n 2 + 4d 2 )r 2 -a i n 2 = and ^s(z ) = -(2r + 

G )~ 1 (2^ a ( r o + ^)( 2r o + a) + ^)- 

(2) If n = then either r = 0, N = — < ^(z ) 2 is a solution to the equation 

= N 2 + (a 2 - 2m)N + m 2 - d 2 and^s(z ) = -a _1 (m + d- N ), or r is 
a solution to the equation = 16r 4 + (—8a 2 + 16m)r 2 — a 2 (4m — a 2 ) + 4d 2 
and ^s(zo) = — (2r + a) _1 (|a(2r + a) + d). 

Proof. f{z) = {x + r) 2 + a(x + r) + b = (2r + a)x + r 2 — N + ar + 6, which means 
that g(r, N) = 2r + a and h(r, N) = r 2 — N + ar + 6. 

g(r, iV) has no solution (ro, iVo) in F x F, therefore for each root zq = ro + xq of 
f(z), (r , iV ) is a solution to -g(r, N)g(r, N)g(r, N)N = h(r, N)g(r, N)h(r, N). 

We shall solve this equation then. — (2r + a)(2r — a)(2r + a)N = (r 2 — iV + 
ar + 6)(2r - a)(r 2 - + ar + 6). 
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Taking only the part of the equation which anti-commutes with a we obtain 

= d(2r — a)(r 2 — iV + ar + m + na) + (2r — a)(r 2 — iV + ar + m + na)d = 
((2r + a)(r 2 — iV — ar + m — na) + (2r — a)(r 2 — N + ar + m + na))d 

Which means that = (2r + a) (r 2 — AT — ar + m — na) + (2r — a) (r 2 — iV + 
ar + m + na) = 4r 3 — 4rN + 4rm — 2a 2 r — 2na 2 . 

If n 7^ then r ^ 0, and so iV = r 2 + m — ^a 2 — ^na 2 . 

h(r, N) = r 2 — N + ar + 6 = r 2 — (r 2 + m — |a 2 — ^na 2 ) + ar + m + na + d = 
\a 2 + ^na 2 + ar + na + d = ^a(r + n)(2r + a) + d 

The equation of interest is — (2r + a)(2r — a)(2r + a)N = h(r, N)(2r — a)h(r,N). 
Its part which commutes with a provides us with — (2r + a)(2r — a)(2r + a)N = 
(£a(r + n)(2r + a))(2r - a)(±a(r + n)(2r + a)) + d(2r - a)d = -^(2r + a)(4r 2 - 
a 2 )a 2 (r + n) 2 + (2r + a)d 2 

Therefore — (4r 2 — a 2 )iV = -^(Ar 2 — a 2 )a 2 (r + n) 2 + d 2 , which means that 
= i ^(4r 2 - a 2 )(4r 2 (r 2 + m - \a 2 - ±na 2 ) + a 2 (r + n) 2 ) + d 2 = i ^(4r 2 - 
a 2 ) (4r 4 + 4r 2 m - 2a 2 r 2 - 2rna 2 + a 2 r 2 + 2a 2 rn + a 2 n 2 ) + d 2 = (Ar 2 - a 2 ) (Ar 4 + 
4r 2 m — a 2 r 2 + a 2 n 2 ) + d 2 . 

Consequently 16r 6 + (— 8a 2 + 16m)r 4 +(— a 2 (Am— a 2 ) +Aa 2 n 2 +Ad 2 )r 2 — a 4 n 2 = 0. 

If n = then = 4r 3 - 4riV + 4rm - 2a 2 r = r(4r 2 - AN + Am - 2a 2 ), 
which means that either r = or iV = r 2 + m — \a 2 . If r = then a 3 N = 
(—N + m + d)(—a)(—N + m + d). Taking only the part which commutes with a 
we obtain a 3 N = —(—N + m) 2 a + ad 2 , hence a 2 N = —N 2 + 2mN — m 2 + d 2 , and 
consequently = N 2 + (a 2 — 2m) N + m 2 — d 2 . 

li N = r 2 + m — \a 2 then h(r, N) = r 2 — N + ar + b = r 2 — (r 2 + m — 
\a 2 ) + ar + m + d = \a 2 + ar + d = \a(2r + a) + d. From — (2r + a)(2r — 
a)(2r + a)N = h(r, N)(2r - a)h(r, N) we obtain -(2r + a)(2r - a)(2r + a)N = 
(|a(2r + a) + d)(2r — a)(|a(2r + a) +d). Taking the part which commutes with a 
we get -(2r + a)(2r - a)(2r + a)N = \a 2 (2r + a) 2 (2r - a) + (2r + a)d 2 . Therefore 
-(4r 2 -a 2 )iV = ia 2 (4r 2 -a 2 )+d 2 , hence = i(4r 2 -a 2 )(4(r 2 +m-|a 2 )+a 2 )+d 2 = 
|(4r 2 — a 2 )(4r 2 + Am — a 2 ) + d 2 and consequently = 16r 4 + (—8a 2 + 16m)r 2 — 
a 2 (4m-a 2 ) +4d 2 . □ 

Remark 3.2. This solution has no computational advantage over Huang and 
So's solution in [3], since both require solving cubic equations with the variable r 2 
where r = ^I(zq). Still, it is good to have this as a result of Theorem \2.1\ 

A. Pure imaginary roots of a quaternion standard polynomial 

As in Section El given a polynomial f(z) there exists unique polynomials g(r, N) 
and h(r, N) such that f(z) = g(r, N)x + h(r, N). Here we are interested in finding 
only the pure imaginary roots. So we substitute r = and then x = z, and by 
writing g(N) and h(N) instead of g(0,N) and h(0,N) respectively, we obtain 
f(z)=g(N)z + h(N). 

In particular, deg(g) = [ deg /~ 1 J and deg h < |_H2~J • 

The following corollary is a special case of Theorem 12.11 
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Corollary 4.1. Given pure imaginary element Zq G Q such that —z\ = N G F , 
Zq is a root of f(z) if and only if one of the following 

(1) iVo is a solution to both h(N) = and g{N) = 0. 

(2) iVo is a solution to the equation —g(N)g(N)g(N)N = h(N)g(N)h(N) and 
z = -g(N )~ 1 h(N ). 

Proposition 4.2. The number of roots of f(z) in V is infinite if and only if 
h(0, N) = and g(0, N) = have a common solution. 

Proof. If h(N) = and g{N) = have a common solution N then every element 
zq G Q satisfying — Zq = N is a root of f(z). 

If h(N) and g(N) have no common root, and Zq is a root of f(z) satisfying 
— z^ = N , then iVo is of neither h(N) nor g{N). On the other hand, iVo is a 
solution to the equation g(N)g(N)g(N)N = h(N)g(N)h(N). The degree of the 
left-hand side of this equation is 3deg(g) + 1, while the degree of the right-hand 
side is 2 deg(h) + deg(g). There is an equality only if 2 deg(g) + 1 = 2 deg(h), but 
that can never happen, therefore the equation is not trivial, which means that by 
splitting the equation into four (according to the structure of Q as a vector space 
over F) we have at least one nontrivial equation. Consequently, the number of 
roots of this system is finite, and therefore the number of pure imaginary roots 
of f(z) is finite. □ 

Remark 4.3. If z = x £ V is a root then N{g{0,N Q ))z o = -g(0,N )h(0,N ). 
Since 9R(zo) — 0, we obtain = ?R,(—g(0,N )h(0,N )). In many cases this is an 
adequate indication for checking whether a given element z® G V is a root of f(z). 

Corollary 14.11 enables us to transform the problem of finding pure imaginary 
roots of a polynomial deg(/) into the problem of finding the intersection of the 
roots of up to four cental polynomial equations of degrees up to l| deg/ — | if 
deg(/) is odd, or up to l|deg/ — 1. 

In the particular case of cubic case it especially useful, as we shall see in the 
next section. 

5. Solving cubic quaternion equations with at least one pure 

imaginary root 

Lemma 5.1. For any division algebra D, and polynomial p(z) G Di\z\, if z ^ a 
is a root of f(z) = p(z)(z — a) G D L [z] then = Zq — (a + b)z + ba for some root 
b ofp(z) 

Proof. According to Wedderburn's method, b = (zq — o)zq(zq — a) -1 is a root of 
p(z). Hence b(z — a) = (z$ — a)z, i.e. = Zq — (a + b)zo + ba. □ 

Remark 5.2. If the decomposition to linear factors of a given polynomial f(z) G 
is known, then the question of finding its roots becomes (inductively) a 
sequence of quadratic equations one has to solve. If D is a quaternion algebra 
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then quadratic equations are solvable and so one can obtain the roots of any 
standard polynomial if he knows its decomposition into linear factors. 

Remark 5.3. If f(z) = p{z)(z — a) and all the roots of p(z) are known then the 
question of finding the roots of f(z) becomes a set of quadratic equations. If D is 
a quaternion algebra then quadratic equations are solvable and so one can obtain 
the roots of f(z) if he knows the roots ofp(z). 

Let f(z) be a quaternion standard cubic polynomial. The equation —g(N)g(N)g(N)N = 
h(N)g(N)h(N) (from Corollary 14. If) is with one central variable N and is of de- 
gree 4 at most, and therefore its roots can be expressed in terms of radicals, which 
means that the pure imaginary roots of f(z) can also be expressed in those terms. 

Assume f(z) has one such root a, then f(z) = p{z)(z — a). The polynomial 
p(z) is quadratic and therefore its roots can be traced. Consequently, p(z) can 
be fully factorized into linear factors and so is f{z). Furthermore, according to 
Lemma [5 ■![ the roots of f(z) are at hand. 

In conclusion, we have a routine for finding all the roots of a quaternion stan- 
dard cubic polynomial as long as it has a pure imaginary root. 

Remark 5.4. It may happen that f(z) has no pure imaginary roots in Q, but it 
has in some restriction of Q. For example, if Q = (— 1, — 1)2,q = Q[hj '■ i 2 — 
j 2 = —l,ij = —ji] then the equation z 3 + 3i = has no pure imaginary root in 
Q but does have pure imaginary roots in Q<8>R = EI. In case that f(z) has no 
pure imaginary root in Q but has such a root in some division algebra D = Q®K 
(where K/F is some field extension) then we shall apply our method to find its 
roots in D. The roots of f(z) in Q are of course a subset its root in D. 

5.1. Example. Consider the polynomial f(z) = z 3 + (2 + ij)z + i — j e Hbj,[z]. 

g(0, N) = —N + 2 + ij and h(0, N) = i — j. They have no common root, so 
we turn to solve -g(0, N)N(g(0, N))N = h{0, N)g(Q,N)h(0, N) i.e. —(—N + 2 + 
ij)((-N + 2) 2 + 1)N = (i - j)(-N + 2 - - j) 

— (—N + 2 + ij)(N 2 -AN + 5)N = (-N + 2 + - - j) 

— (iV 2 — AN + 5) AT = -2 
N 3 -AN 2 + 5N -2 = 

In general this equation could be split into up to four equations correspondingly 
to the dimension of Q over F . However, in this case, iV 3 — 4iV 2 + 5iV — 2 is 
absolutely real and has no imaginary part, which means that we have to solve 
only one cubic real equation. 

Therefore either iV = 1 or N = 2. According to Theorem l2.ll the corresponding 
roots are -g(0, JV) -1 ^, N), i.e z x = - ij)(i - j) = -\{i - j - j - i) = j 
for N = 2 we have z 2 = — j) = — i — j. 

Consequently f(z) = p(z)(z — j). Next goal is to calculate p(z). 

Remark 5.5. Let us be reminded of how to decompose f(z) to p(z)(z—a) knowing 
a: 
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f{z) = z n + c n _ x z n ~ x + • • • + c 
f(a) = a n + c^ia™- 1 + • • • + c 

f(z) = /(*)-0 = /(z)-/(a) = (^-a") + c n _ 1 (z"- 1 -a"- 1 ) + --- + c 1 (z-a) = 
((z™" 1 + az n ~ 2 + ■■■ + a"" 1 ) + c n .x{z n - 2 + ■■■ + a"" 2 ) + • • • + a)(z - a) 
p(z) = (z 71 - 1 + az n ~ 2 + ■■■ + a"" 1 ) + c n _i(z n - 2 + ■■■ + a n ~ 2 ) + • • • + c v 

Consequently p(z) = [z 2 + jz — 1) + 2 + ij = z 2 +jz + l + ij. 

—i — j is a root of /(z) but not of z — j, hence according to Remark 15. 5\ 

(-i - 2j)H -j)H ~ 2J)- 1 = - 2j)H - + 2j) = |(-1 - 2ij + - 
2)(i + 2j) = |(-3-y)H-j) = |(3i + 3j + j-i) = |(2z + 4j) is a root of p(z). 

The second and final root of p(z) (which can be obtained using the methods) 
is i. 

Again, due to Wedderburn, p(z) = (z + i + 1 + ij)(z — i), which means that 
f(z) = (z + l+i + ij)(z-i)(z-j) 

Let z be some root of f(z). According to Lemma I5.1[ since i is a root of 
p(z) and is different from |(2i + 4j), zq must correspond to it, which means that 
Zq — (j + i)z + ij = 0. j is a root, however it is already known to be a root of 
f(z) so we look for the other one. Let t = zq — j and so t 2 — it + tj = 0. Let 
r = t^ 1 and so 1 — ri + jr = 0. r = C\ + c(l + c^j + Cy-zj, so we obtain the following 
linear system 

(1) 1 + Ci-Cj = 

(2) -ci + cy = 

(3) -c»j + ci = 

(4) Cj - Ci = 

This system has no solution. Therefore, /(z) has no roots besides j and — z — j. 

Remark 5.6. If we followed the numerical approach of [4J then we would have 
to solve an equation of degree 6 over the center. In general, such an equation is 
not analytically solvable. 

6. A NOTE ON QUADRATIC TWO-SIDED POLYNOMIALS 

A two-sided polynomial is a polynomial of the form z n + a n -\z n ~ l b n -\ H — • + 
aizbi + c. In [5j Janovska and Opfer provided an example of a quadratic two-sided 
polynomial with more than two essential roots. 

However, two is the upper boundary of the number of pure imaginary roots of 
such a polynomial. 

Proposition 6.1. The number of pure imaginary roots of f(z) = z 2 + azb + c, 
assuming a,b,c ^ 0, is at most two. 

Proof. Let zq be a pure imaginary, then — Zq = Nq e F. Therefore —Nq + az^b + 
c = 0, i.e. Nq — c = azob, hence a _1 6 _1 A^o — a _1 c6 _1 = Zq, which means that 
a-^a-^N 2 - (a^b^a^cb- 1 + a^dr^b-^No + a~ x dT x a~ x cb~ x = -N . 
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Consequently, N is a root of the non-trivial polynomial p(N) = a~ 1 b~ 1 a~ 1 b~ 1 N 2 + 
(1 — a^b^a^cb^ 1 — a~ 1 cb~ 1 a~ 1 b~ 1 )N + a" 1 dr l a -1 dr l . Hence, the number of 
pure imaginary roots of f(z) does not exceed 2. □ 
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